Using Genetic Programming to Evaluate the Impact of Social Network Analysis in Author Name Disambiguation

نویسندگان

  • Felipe Hoppe Levin
  • Carlos Alberto Heuser
چکیده

In digital libraries, which have become extremely popular in the scientific community, often people want to find publications by an author using the author name as a query. However, since authors may have many denominations and one denomination may refer to many authors, name searches may present ambiguous results. To tackle this problem, several studies have been developed. Recently the use of social networks has been studied in author name disambiguation. In this article, we use a machine learning approach based on Genetic Programming to evaluate the impact of social network analysis in author name disambiguation. Through experiments using real-world data, we show that social network analysis greatly improves the quality of results. Also, we demonstrate that our approach is able to compete with state-of-the-art techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the Use of Social Networks in Author Name Disambiguation in Digital Libraries

Digital libraries have become an important source of information for scientific communities. However, by gathering data from different sources, the problem of duplicate and ambiguous information about author names arises. Traditional methods of name disambiguation use syntactic attribute information. However, recently the use of relationship networks has been studied in data deduplication. This...

متن کامل

بهبود صحت ابهام‌زدایی نام نویسنده با استفاده از خوشه‌بندی تجمّعی

Today, digital libraries are important academic resources including millions of citations and bibliographic essential information such as titles, author's names and location of publications. From the view of knowledge accumulation management, the ability to search fast, accurate, desired contents, has a great importance. The complexity and similarity in these resources cause many challenges and...

متن کامل

Investigating Association between Social influence, Productivity, and Performance in Co-author Network of Researchers in Medical Ethics

The purpose of this research is to investigate association between social influence, productivity, and performance among researchers of medical ethics field.  This research was done using common methods in scientometric studies with the method of co-author and network analysis. The statistical population of the study consists of all articles published in journals in the field of medical ethics,...

متن کامل

Sustainability in paper industry closed-loop supply chain (case study: East Azerbaijan province, Iran)

Governments and customers are forcing the paper manufacturers to become more sustainable. Accordingly, there still exists a gap in the quantitative modeling of these issues. In this paper, this gap is covered through simultaneously considering economical, environmental and social impacts in the paper closed-loop supply chain network design. The proposed multi-objective, multi-echelon, multi-pro...

متن کامل

An Application of Genetic Network Programming Model for Pricing of Basket Default Swaps (BDS)

The credit derivatives market has experienced remarkable growth over the past decade. As such, there is a growing interest in tools for pricing of the most prominent credit derivative, the credit default swap (CDS). In this paper, we propose a heuristic algorithm for pricing of basket default swaps (BDS). For this purpose, genetic network programming (GNP), which is one of the recent evolutiona...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010